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Landauer's principle states that the erasure of one bit of information requires the free energy 
fcTln2. We argue that the reliability of the bit erasure process is bounded by the accuracy inherent 
in the statistical state of the energy source ('the resources') driving the process. We develop a general 
framework describing the 'thermodynamic worth' of the resources with respect to reliable bit erasure 
or good cooling. This worth turns out to be given by the distinguishability of the resource's state 
from its equilibrium state in the sense of a statistical inference problem. Accordingly, Kullback- 
Leibler relative information is a decisive quantity for the 'worth' of the resource's state. Due to 
the asymmetry of relative information, the reliability of the erasure process is bounded rather by 
the relative information of the equilibrium state with respect to the actual state than by the relative 
information of the actual state with respect to the equilibrium state (which is the free energy up to 
constants). 



I. INTRODUCTION 

One of the characteristic features of technological 
progress is the increase of human ability to control and 
design the microscopic world. Especially the recent suc- 
cesses in manipulating simple quantum systems (for ex- 
ample in the context of Quantum Computing research) 
are one aspect of this general development. Since every 
process controlling microscopic particles is disturbed by 
heat, this progress is strongly connected with the inven- 
tion of efficient cooling mechanisms (see jjj, This 
statement is in some senseQ a tautological one: Prepar- 
ing a physical system in a pure quantum state means 
preparing a state without entropy, i.e., a system with- 
out heat. In present day cooling techniques, the size of 
the required apparatus is quite impressive compared to 
the tininess of the cooled systems. In contrast, minia- 
turization in computer technology will require smaller, 
efficient and power saving mechanisms for draining off en- 
tropy on the nanoscopic or microscopic level. This raises 
the question for fundamental lower bounds on the re- 
sources needed for cooling simple quantum systems. At 
first sight the answer seems to be given by well-known 
thermodynamic theory, in particular the Second Law: 
Extracting the entropy S from a system requires the en- 
ergy SkT where k is Boltzmann's constant and T the 
temperature of the surrounding heat bath absorbing the 
entropy. Another formulation of this law is Landauer's 
principle saying that the erasure or initialization of one 
bit being in an unknown state requires at least the en- 
ergy ln2fcT (see @], §], g]). But this cannot be the 
complete answer: To understand the fundamental lim- 
itations on scaling down the cooling apparatus and re- 
ducing the resources, we model the cooling process as 



an energy conserving unitary dynamics on the composi- 
tion of the considered quantum system with another one 
('the resources'). Within this microscopic model we do 
not expect that necessary and sufficient conditions for 
the resource's quantum state to enable effective cooling 
procedures are given by well-known laws of thermody- 
namics. 

Of course, a lot of steps have already been made to- 
wards a refinement of thermodynamics on the level of 
low-dimensional quantum systems (see e.g. 0, [||). Ac- 
tually, one should reckon all the results concerning en- 
tanglement purification ^| , quantum error correction (Tc[ | 
[ 1 1 1 , quantum data compression |l2| ] , and logical cooling 
[13 1 as such since they are dealing essentially with the 
transport and concentration of information by operations 
on compositions of simple quantum systems. Neverthe- 
less, our approach is rather different from those ones: 
Our microphysical models of cooling include the energy 
source - a quantum system as well - driving the pro- 
cess, i.e., we restrict the class of unitary transformations 
to those conserving the total Hamiltonian of the system. 
This setup emphasizes the fact, that we want to develop 
a theory of thermodynamics in contrast to a pure theory 
of information: The latter one deals with information 
only, while the first one focuses on the relation between 
energy and information. 

Some consequences of the restriction to energy conserv- 
ing transformations can be illustrated easily: Consider a 
bipartite quantum system consisting of a harmonic os- 
cillator with frequency v and a two-level system with 
energy levels and hv. Assume both systems to be in 
their equilibrium states for the same (finite) temperature. 
Then one can easily construct unitary transformations on 
the composite Hilbert space extracting entropy from the 
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x We use the cautious formulation 'in some sense' because of the following objection: If the system has a large energy gap 
between its ground state and the first excited state, it is in an almost pure state even for not too low temperatures. 
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two-level system and pumping into the oscillator. One 
can even show, that there are no bounds on the efficiency 
of such a cooling process, i.e., the state of the two-level 
system can be prepared arbitrarily close to a pure one. 
In contrast, there is no energy conserving unitary trans- 
formation changing the state of the system at all. Such 
a process would even violate the Second Law, since this 
would be a dynamics producing free energy without the 
use of an additional energy source. Accordingly, if the 
state of the harmonic oscillator differs slightly from its 
equilibrium state we will expect that an energy preserv- 
ing process can only have a slight cooling effect. Lead by 
this intuition, we investigate in which way the size of the 
deviation of the quantum system's state from its equi- 
librium state determines its 'thermodynamic worth' for 
enabling good cooling processes, or more generally, for 
precise preparation of quantum states. Reformulated in 
the spirit of the 'thermodynamics of computation', we in- 
vestigate the minimal resource requirements for a reliable 
bit erasure process. 

The paper is organized as follows: In section II we 
give a short introduction into thermal equilibrium states 
of quantum systems. In section III we present the formal 
setup of the microscopic cooling process and give nec- 
essary and sufficient conditions for the resource's state 
to allow for cooling a two-level system. In section IV 
we introduce a more flexible model in which cooling is 
described by a unitary dynamics on a tripartite system: 
The resources, the environment being in thermal equi- 
librium, and the two-level system to be cooled down. 
We prove that cooling is possible if and only if the time 
average of the resource's state does not agree with its 
equilibrium state. If the temperature of the two-level 
system is already below the environment's temperature, 
the deviation of the resource's state from equilibrium de- 
termines whether it is possible to cool the qubit even 
more. The second part of this section answers the totally 
different question of the lowest qubit-temperature which 
can be obtained by using the given resources if the qubit 
has initially the same temperature as the environment. 
We show that the determination of the lowest obtain- 
able temperature can be reduced to a quantum inference 
problem, namely the determination of error probabilities 
of a decision rule for distinguishing the resource's state 
from its equilibrium state. Sections V-VII shows conse- 
quences of our theory and analyze in which sense they go 
beyond well-known laws of thermodynamics. 



II. THERMODYNAMIC BACKGROUND 

Let H be the finite or infinite dimensional Hilbert space 
of a quantum system and H a selfadjoint operator act- 
ing on H. representing its Hamiltonian. Then, for any 
temperature T the density matrix 

p T :=e- H /^/tr{e- H '^), 



where k is Boltzmann's constant, is called the ther- 
mal equilibrium state with temperature T provided that 
ir(e _ ' ff /( feT - ) ) exists. Note that we do not define temper- 
ature as a property of every state, but merely for those 
of the form described above. 

As usual, we will use the inverse temperature defined 
by (3 := l/(kT) and consider the class of states 

p p : = e-^/trie-P") 

for any (3 with — oo < [3 < oo. 

In the special case of a non-degenerate two-level system 
this implies that an inverse temperature can be assigned 
to any density matrix commuting with the Hamiltonian. 
For two diagonal-states the state with lower (3 is the hot- 
ter state. The fact, that heating up to a value (3 < 
decreases the entropy is the well-known phenomenon of 
temperature inversion Jl4| . In order to avoid confusion 
we emphasize that heating means here increasing the oc- 
cupation probability for the upper state. This is con- 
nected with an increase of entropy for f3 > and a de- 
crease of entropy for (3 < 0. This unusual connection 
between entropy and heat due to temperature inversion 
might be confusing. However, we will mostly focus on 
cooling, since the corresponding statements for heating 
in our sense can be obtained analogously. In contrast, if 
one considers the maximally mixed state (T = oo) as the 
hottest one, there is no such analogy and the prepara- 
tion of the hottest states does not cause any difficulties 
comparable to the preparation of the coldest one. 

Since we want to interpret our results in the context of 
'thermodynamics of computation' we keep in mind that a 
two-level system can be considered as an one-bit-memory 
and any process producing an (almost) pure state from 
a mixed one will be considered as an erasure process of 
one unknown bit of information. 

In the following sections the dependence of the equi- 
librium states from the temperature will mostly not be 
mentioned explicitly, since troughout the paper we fix 
one common reference temperature T ^ 0,T ^ oo (and 
the corresponding inverse temperature f3) representing 
the temperature of the particle's environment. 

III. THE MODEL 

To investigate the ability of cooling or heating a multi- 
level quantum system within a precise mathematical 
framework, we introduce some terminology: Here, a 
guantum system is uniquely characterized by its Hamil- 
tonian H, since it determines in a unique way the cor- 
responding Hilbert space and its dynamics. Up to an 
irrelevant translation of the energy scale, for any fixed 
inverse temperature [3 there is an one-to-one correspon- 
dence between the system's Hamiltonian and its equilib- 
rium state. Note that any unitary operator u commutes 
with H if and only if it commutes with its equilibrium 
state provided that (3^0 and (3 ^ oo, i.e, a dynamics is 
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energy conserving if and only if it preserves the equilib- 
rium state. 

Every quantum system can be in different statistical 
states, described by a density matrix acting on the same 
Hilbert space as the Hamiltonian. We call a system being 
in a particular statistical state an object. More formally 
we define: 

Definition 1 

1. A (quantum) system is described by a density ma- 
trix 7 ( its 'equilibrium state ') acting on a finite di- 
mensional Hilbert space H. 

2. An object is a pair (p, 7) where p is a density ma- 
trix describing the actual mixed state of the system. 
Hence, a system in equilibrium is described by an 
object of the form 

(7.7)- 

Usually we will assume both matrices to have full 
rank. 

3. For two systems 7 and 7 we define the composed 
system as the system determined by the equilib- 
rium state 

7 <g> 7. 

4- For two objects O := (p, 7) and O := (p, 7) the 
composed object is defined to be 

OxO:=(/)8m8t) 

5. If u is a unitary operator acting on H with uju* = 
7, i.e, u is an 'energy conserving reversible dynam- 
ics ', we define the allowed transformation T u on 
the object O :— (p, 7) by: 

T„((p, 7 )) := (upu*, 7 ). 

In abuse of language, we will call u an allowed 
transformation as well. 

6. If a system 7 <g> 7 is in the state p ( where p is not 
a tensor product state necessarily) , we define the 
restriction of the object O := (p, 787) to its left, 
respectively right, component as 

Ot := (tr r (p),7) 

and 

O r := (irj(p),7), 

where tri and tr r denote the partial trace over the 
left, repectively right, component in the tensor prod- 
uct. 



Within this framework, the problem of cooling a two- 
level system ('qubit') by given resources can be formal- 
ized as follows: 

Given the arbitrary object O ('the resources') and the 
qubit Q := (a, a), with 

° ~ i + 1 e -0E dia 9(^ e ~ 0E )^ 

where E is the energy gap of the two-level system. Find 
an allowed transformation T u on 

OxQ 

which serves as a cooling process for Q, i.e., 

(T(0 X Q)) r 

is a qubit with a lower or higher temperature compared 
to the initial state a. 

Firstly we will look for those allowed transformations 
which minimize or maximize the occupation probability 
for the upper level. Let a z be the Pauli matrix 

o z := diag(l,-l) 

and assume the Hamiltonian of the qubit to be 

H := diag(0,E). 

Then the occupation probability for the upper level is 
maximized (respectively minimized) for those transfor- 
mations u which minimize (respectively maximize) the 
term 

tr(u(p ® a)u*{\ O o- z )). 

We find necessary and sufficient conditions for the 
transformations u to be optimal: 

Lemma 1 Let a be the density matrix of a bipartite sys- 
tem composed of a qubit with equilibrium state a as above 
and another arbitrary system with equilibrium state 7, 
i.e., we have the object 

(a, 7 (g> a). 

Let Pi be the spectral projections 0/7 ® cr. Assume that 
the following two conditions hold: 

1. All the operators PjCtPj commute with 1 (g) a z , 

so that we can divide the eigenvalues of the restric- 
tion of PjCtPj to the range of Pj into subsets T J + 

and T J _ corresponding to the eigenvectors of 1 <S> a z 
with eigenvalues +1 and —1, respectively. 

2. The smallest eigenvalue in T ] + is greater than the 
greatest eigenvalue in T 3 _. 
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Then there is no allowed transformation u on O x Q 
decreasing the occupation probability of the upper state, 
i.e., we have: 

tr(uau*{\ (g) o z )) > tr(a{\ ® a z )), 

for every unitary operator u with [u, 7 ® a] = 0. 

The Lemma will be proved in the appendix. 

For any allowed transformation u we can decide 
whether there can exist a better one for cooling by setting 
a := u(p®a)u* . Then Lemma [l] gives a criterion whether 
there can exist a better transformation u' . Furthermore 
it shows, that the optimal transformation for reducing 
the probability of the upper state or the lower state can 
always be chosen in such a way that the reduced density 
matrix of the qubit is still diagonal after one has per- 
formed the unitary transformation u. Therefore we can 
obtain an equilibrium state with a temperature different 
from the reference temperature. 

We shall use the following notion: 

Definition 2 Let Q := (cr,a) be a qubit in its equilib- 
rium state. We say 'the object O :— (p, 7) ' can be used 
for cooling Q if there is an allowed transformation T u on 
O x Q such that 

tr{u(p <g> er)u*(l ® a z )) > tr(pa z ). 

We say that it can be used for heating if we have '< ' 
instead of '> '. 

In order to give necessary and sufficient conditions for 
the possibility of cooling or heating the following sugges- 
tive definition turns out to be useful: 

Definition 3 For any object (/?, 7) let \i) and \j) be 

eigenvectors of 7. Let Ei and Ej be the corresponding 
eigenvalues of the system's Hamiltonian, i.e., 



Ej — Ei 



P 



Then the relative inverse temperature with respect 
to the states \i) and \j) is defined to be 



P\i),\ 



3) 



hx{i\p\i) - \n(j\p\j) 



Ej — Ei 



Similarly, we define the relative temperature 

1 



k(3\i),\j) 



Using this definition we have an easy criterion for the 
possibility of cooling: 



Theorem 1 An object O := {p,"f) can be used for cool- 
ing a qubit Q :— (a, a) with energy gap E and the in- 
verse temperature (3 if and only if there is a pair \i) and 
\j) of eigenvectors of the Hamiltonian H ( correspond- 
ing to j) with different eigenvalues Ei and Ej such that 
Ei — Ej = E and 

P\i),\j) > P- 

Proof: Assume P\i),\j) > P- Let |1), . . . , |^) be a ba- 
sis of eigenvectors of 7. Let |0) and 1) be the lower and 
upper state of the two-level system (In case of degener- 
ated levels the choice is irrelevant). Then the occupation 
probability for the ground state is given by 



W\o) = J2(j\p\j)(oW\o)- 



Now we perform the transformation u by permuting the 
states by the involution 



\i) ® |0) 



|J>®|1> 



and acting trivial on the other tensor product basis 
states. The probability for the lower state is changed 
by the amount 

tr{{u{p ® a)u* -{p® cr))(l ® |0)(0|)) 
= {j\p\j){l\a\l)-(S\p\m^)- 

The latter term is negative by assumption and due to the 
definition of (3 and P\i),\j)- 

Assume P\i),\j) < P- Clearly, for any j the spectral 
projection Pj can be written as 

Pj = (Q + ®|0)(0|)e(Q_®|l)<l|), 

where Q+ and Q- are spectral projections of 7. Since a 
commutes with |0)(0| and |1}(1| we have: 

Pj(p ® a)Pj = Q+pQ+ ® |0> <0|<0jcr|0> 8 
Q_pQ_ ®|1)(1|(1|(7|1). 

The eigenvalues of the first component in this direct sum 
belong to T J + , those in the second to T-L . If E t — Ej = E 
the quotient of any eigenvalues of Q+pQ+ and any eigen- 
value of Q-pQ- can never exceed e~^^^^ E . Therefore 



< 



P\ 

filled. □ 



j3 implies that condition (2) in Lemma [j] is ful- 



In the sense of the definition || we have the strong state- 
ment, that the low temperature which should be attained 
in the qubit must already be inherent in the used re- 
sources. For the moment, the problem of cooling seems 
to be circular and one might ask, why cooling is possible 
at all. 

We will show that there is an easy answer, since ar- 
bitrary low relative temperatures can be obtained by 
composing many objects deviating from their equilibrium 
state. In particular, the composition of two objects 0\ 
and O2 being in their thermal equilibrium states for the 
inverse temperature Pi and P2, respectively can contain 
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inverse temperatures larger than fix and (3%. This is the 
quantum analogue of the well-known fact from classical 
thermodynamics, that cooling can be driven by heat with- 
out any other energy supply. This principle is used in an 
absorption heat pump for instance. 

This indicates that the calculation of the relative tem- 
peratures obtained by composing objects might give in- 
teresting insights in the problem of 'the origin of low 
temperatures'. We will develop a quite general theory of 
relative temperatures in composed systems, but we will 
restrict our investigations to the case that the density ma- 
trices of the considered systems are diagonal with repect 
to any basis diagonalizing the Hamiltonian. Furthermore 
we will restrict the class of allowed transformations to 
those which permute the basis states. We will call this 
the 'quasi-classical case' and define: 

Definition 4 

1. A quasi-classical (1-level) system is described 
by a vector g £ M 1 defining the probabilities for find- 
ing the system in one of the states {1, . . . , I}. 

2. A quasi-classical object is a pair (p, g) where 
p eR 1 is the probability distribution of the actual 
state and g £ M 1 the equilibrium distribution. Let 
Pi and gi be the components of the vectors p, g. 

3. An allowed transformation is a permutation it 
of the states 1, . . . , I which leaves g invariant, i.e., 
flV(i) = 9i f° r ever V 1 < * < l - 

4- Composition of objects and composition and re- 
striction of systems are defined as in the quantum 
case (see Definition^), i.e., we have tensor product 
vectors describing joint probability distributions, re- 
strictions of objects are defined by marginal distri- 
butions. 

In analogy to Definition Q ; a relative inverse tem- 
perature [3i j can be assigned to any pair (i,j) £ 
{1 I\< 

Now we are able to give an example for the statement 
that the composition of an n-fold copy of the identical 
object can lead to arbitrary low temperatures as n in- 
creases: Take a system with the energy levels 0, E, 2E 
being in the statistical state p = (pi,P2,P3)- Let n be an 
odd number and set n = 21 — 1. We assume 

1 . P3 Pi , 

1 > =: a 

P2 P2 

In the n-fold composition of the object (p, <?), i.e., in 
(p® n , g® n ), we consider the following two states |1) and 
|2>: 

Let |1) be some state in which I of the subsystems are 
on the level 2E and I — 1 are in the level 0. Let |2) be 
the unique state where every system has energy E. The 
quotient of the probabilities of these two states is 



pi 

the energy difference of both is E. Hence we get the 
relative inverse temperature 

ft,2 = --(llnd + ln(p 2 /pi)), 

which tends to infinity for increasing I. 

It turns out, that the problem of determining the rel- 
ative inverse temperatures in an object composed of two 
quasi-classical ones is a geometrical one: For any pair 
(i, j) of states of the object O := (p, g) we define a vector 
Vi,j G M 2 by 

Vi,j(0) ■= (^MSi/ '9j)M{Pi/Pj))- (1) 

Note that the quotient of relative inverse temperature 
and the reference inverse temperature j3 of the pair 
is given by the tangens of the angle enclosed by Vij and 
the x-axis. In any composed object OxOwe denote the 
state by 1 and the state (j, j) by 2. We obtain 

v h2 (Oxd) = v i , j (0)+v~ i ~ j (0) 

as the sum of the corresponding vectors for the subsys- 
tems. 

If we define Vo '■= {vij(0)\l < i,j < I}, we get: The 
inverse temperatures available in the n-fold of the object 
are given by the possible values of tan (f>, where <f> is the 
angle enclosed by the vector 

n 

and the x-axis and Xi are arbitrary vectors taken from 
the set Vo- 

Therefore the problem of finding the lowest relative 
temperature in a composed object is a geometrical one. 



IV. INCLUDING THE ENVIRONMENT 

The problem of finding the lowest relative temperature 
in a given object is a little bit artificial for two reasons: 
Firstly, the optimal pair of states can only be used for 
cooling those two-level systems which have the same en- 
ergy gap. Of course it would be more natural to fix the 
required energy gap in advance. But then, in the generic 
case, one will not find any appropriate pair of states at 
all. Secondly, it does not make sense to assume that the 
two-level system and the resources must be isolated from 
the rest of the world. Since it is even impossible to pre- 
vent this systems from interacting with the rest of the 
world, it seems unphysical to forbid such an interaction 
even if it would help for cooling. 



5 



In a modified model, both shortcomings of the theory 
can be removed at once: We will investigate the possi- 
bilities of cooling a given object under the assumption 
that one can use the help of arbitrary additional equi- 
librium objects. They can be thought of as the system's 
environment, i.e., physical systems as particles and field 
surrounding the considered objects. We will assume the 
environment to be in its equilibrium state, since we con- 
sider this as its defining property: every non-equilibrium 
object would be reckoned as additional resources. 

In other words, we will investigate the 'worth' of the 
resources with respect to cooling under the assumption 
that equilibrium objects can be obtained for free. There 
are two reasons why the inclusion of ancilla equilibrium 
objects may help for cooling: On the one hand, gener- 
ically the energy gaps of the resources pure states will 
not coincide with the energy difference of the two-level 
system. Then, an additional equilibrium object with an 
appropriate level structure enables to perform nontrivial 
transformations at all. On the other hand, a cooling pro- 
cess driven by heat without any other energy supply is 
only possible with the use of objects having the (lower) 
reference temperature. Loosely speaking: The Second 
Law states that 'heat without cold is worthless' for driv- 
ing any process. 

In order to avoid unnecessary mathematical compli- 
cations, we will assume the ancilla objects to be finite 
dimensional quantum systems. This should not be con- 
sidered as an essential restriction, since we are only inter- 
ested in statements which do not refer to any particular 
level structure of the ancillas. Furthermore, in some sense 
the infinite dimensional case is included in our analysis, 
since we allow sequences of systems with growing dimen- 
sion as environments. 

We shall see that the help of an appropriate environ- 
ment is so useful, that in the quasi-classical case even 
every non-equilibrium object enables cooling. For the 
quantum case we will show, that an object enables cool- 
ing if and only if the time average of its state differs from 
the equilibrium state. The time average p is defined by 

p := lim - / e- lHs pe lHs ds. 

t^oo t J Q 

It is given by 

3 

where Pj are the spectral projections of the system's 
Hamiltonian H (and the corresponding equilibrium state 
7). This can be seen by theorems from ergodic theory on 
Hilbert spaces |1| : By taking the trace as an inner prod- 
uct on the space of matrices, the time evolution is unitary 
on the density matrices and the map p 1— > PjpPj is the 
orthogonal projection on the eigenspace of the generator 
i[H, .] with eigenvalue 0. 

If an object enables cooling of a qubit having envi- 
ronment's temperature, it is natural to ask whether the 



object allows cooling even if the qubit is already colder 
than the environment. 

We define the lower (respectively upper) limit temper- 
ature of a resource object as the greatest (respectively 
lowest) initial temperature of the qubit such that cooling 
(respectively heating) is just impossible. Note that the 
reference temperature, i.e., the temperature of the used 
ancilla objects, is fixed however. We will introduce a pa- 
rameter which will turn out to determine the lower and 
the upper limit temperatures at once: 

Definition 5 For any object O :— (p, 7) and any pair 
K); \j) °f eigenstates of 7 with eigenvalues A,-,Aj we set: 

f\i),\j)(°) ■=M i \p\ i ) - Mj\p\j) - hxXi+lnXj. 
Then we define the maximal diagonal deviation from 
equilibrium as 

D(0):=m a x{\f ]i)iW \}, (2) 

where the maximum is taken over all pairs of eigenstates. 

Obviously D(0) = if and only if the diagonal en- 
tries of p agree with the entries of the equilibrium state 
with respect to every basis diagonalizing 7. This jus- 
tifies the terminology. Easy considerations show, that 
Z?((p, 7)) = if and only if p = 7, where p is the time 
average of p. Furthermore we have the following refor- 
mulation of Definition ^[ 

Lemma 2 Let O := (p, 7) be an arbitrary object and Pi 
be the spectral projections of 7 for the eigenvalues Xi . 
Then the maximal diagonal deviation is given by: 

D(0) = 

max{|m(||PpP||) + lndK^-pP,-)" 1 !!) - In A, +lnA J |}, 

* ,3 

where (.) denotes the pseudoinverse of any ma- 
trix and \\.\\ is the operator norm defined by \\a\\ := 
max^Hlaxll/lla;!!} where \\x\\ is the euclidean norm of the 
vector x. 

Proof: Obviously, Definition || can be reformulated 

as 

D(0) = 

max max {| ln((i/j|p|i/>}) — ln((0|p|0)) — In A, + In Ad}, 

hi M,\<f>) 

where and \(j>) have to be eigenvectors of 7 corre- 
sponding to Xi and Xj, respectively. The term in the 
braces is maximized if is the eigenvector of PtpPi cor- 
responding to its largest eigenvalue and \4>) corresponding 
to the smallest eigenvalue of PjpPj. But then one has: 

(</>|p|V) = \\PipPiW 

and 

{<j)W) = \\{p ] pP j )- 1 \\-\ 

□ 

The maximal diagonal deviation is a superadditive 
quantity, for quasi-classical objects it is only additive: 
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Theorem 2 We have 

D(0 x 6) > D(0)+D(0) 

for arbitrary objects O := (p, 7) and O := (p, 7) with 
equality i£] p commutes with 7 or p commutes with 7. 

Proof: For the object O let |i) and \j) a pair of eigen- 
vectors of 7 maximizing the expression (Q) and for the 
object O let |Z) and \k) be such a maximizing pair of 
eigenvectors of 7. Then we have 

f\i)®\l),\j)®\k) = ^I»>.Ij> +/|0.l*>- 

In case that the signs of the two terms on the right 
hand side do not agree, we can change it by exchanging 
\l) and \k) due to the antisymmetry of /. 

Assume that p commutes with 7. Let Qi be the spec- 
tral projections of 7(8)7 with the corresponding eigenval- 
ues Write Qi as 

where Pf and Pf are spectral projections of 7 and 7 with 
eigenvalues X\ and A; (respectively) such that A; A; = pi. 
For any matrix let a denote its pseudoinverse by a -1 . 
Due to Lemma |^ the maximal diagonal deviation can be 
written in the form 

D(0 x6) = 
max (| In \\Q. t {p ® p)Q,\\ + In \\(Qj(p <g> p)Q j r 1 \\ 

- ln(^) +ln(^)||. 

From [p, 7] = we conclude P/pP,^ = for I ^ to. 
Hence we have: 

D(0 x 6) = max || In || © ; ((P/pP/) ® (P/pP/)) || 

+in|i© ; ({PipPir^iprpPir^w 

- \npi + lnp,j\\ 

= max { I in max || (P/pP/) ® (P/pP/) || 
+ In max 1 1 (P/ pP/ ) ~ 1 <g> (P/ pP/ ) " 1 1 1 

- In p ; + lnpj|| 
<D{0)+D{6) 

□ 

Since the maximal diagonal deviation vanishes for ev- 
ery equilibrium object we have: 



Corollary 1 The maximum diagonal deviation is stable 
with respect to a composition with arbitrary systems in 
its equilibrium state, i.e. we have 

D{0 x e ) = D{0) 

for every object O and every equilibrium object O e := 
(7,7)- 

Despite the fact, that the quantity D is not additive 
in general, its asymptotical increase for composition of a 
large number n of identical objects is of the order n: 

Lemma 3 Let O be an arbitrary object. Then 
hm^l 

n — >oo 71 

exists. 

Proof: Set O := (p, 7) and f(n) := D(O n ). Firstly we 
show that the sequence f(n)/n is bounded from above: 
Due to Lemma || and the triangle inequality one has 

£»(0»)<|ln||p®»||| + |m||p-"]]| 
+ |ln||7®"||| + |ln||7- n ||| 
= n(|ln||p||| + |ln||p||| + |ln||7||| + |ln||7|||) 
Due to the superadditivity of D one concludes 
f(lm + r)> //(to) + f(r) Vto, !,reN. 

Now let to be fixed. For any n define /„ :— \_(n/m)\ and 
r„ := n — ml n , hence n — l n m + r„, where [-J denotes 
the integer part of a real number. We have 

f{n) _ f{l n m + r n ) l n /(to) + f(r n ) 
n l n m + r n ~ l n m + r n 

Since the right hand term tends to f(m)/m for n — » 00, 
we conclude, that no accumulation point of f(n)/n can 
be smaller than /(to)/to. Because to is arbitrary, f(n)/n 
can have only one cumulation point. □ 

The maximal diagonal deviation can be interpreted ge- 
ometrically: For any pair |i) and \ j) of states set 

p % := (i\p\i) and p := (j\p\j). (3) 

Then we consider the vector 

viAo) 

defined as in equation (Q) and note, that D is given by 
maximizing the length of the projection of the vector 
Vij(0) on the straight line y — —x/ (3. 

The quantity D(0) shows an interesting symmetry 
which will turn out to be important in the theory of heat- 
ing and cooling. This can be seen by the introducing the 
following terminology: 



2 There are easy examples, showing that this condition cannot be dropped: Take a qubit with diag(0,E) as Hamiltonian 
and a coherent superposition of |0) and |1) such that the diagonal entries of the corresponding density matrix agree with the 
equilibrium distribution. Hence D vanishes for this object. But the composition of two such objects has non-vanishing D. 
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Definition 6 For any inverse temperature Pi we call 
p 2 ~ 2/3 - Pi 

its complementary inverse temperature relative to 
the reference temperature /3. 

One checks easily that two qubits (with the same en- 
ergy gap) having inverse temperatures Pi and Pi have the 
same maximal diagonal deviation from their equilibrium 
state. Furthermore we find that for any temperature, its 
complementary value is available by coupling the consid- 
ered system to an equilibrium object: 

Assume we have a pair \i) and \j) of eigenstates of 
the Hamiltonian with relative inverse temperature P\i),\j) ■ 
Take a qubit with energy difference E :— 2(Ei — Ej) with 
inverse temperature p. Then we find 

/%}®|0),|j><g>|i) = 2/3 - P\i),\j), 

i.e., the complementary temperature is available for a 
pair of states with the same energy gap as the original 
one. 

As a consequence we see, that if very high relative 
temperatures are inherent in an object, then very low 
temperatures are inherent in the composition with an 
equilibrium object. Furthermore, lower and upper limit 
temperatures of any object O are determined by D(0). 
In order to state this more precisely we define: 

Definition 2 (new) Let Q be a qubit in any diagonal 
state. Let O be an arbitrary object. We say, O can be 
used for cooling or heating Q, respectively, if there is an 
equilibrium object O e such that there is an allowed trans- 
formation on O x O e x Q decreasing the occupation prob- 
ability for the upper or lower state, respectively. 

From now on we will use this terminology (in contrast 
to Definition ||) and obtain: 

Theorem 3 Let Q be a qubit in any diagonal state. An 
object O can be used for cooling and heating Q if and only 
if 

D{0) > D(Q). 

Ln the case that 

D{0) < D(Q) 

the resource O is worthless in the sense, that it can 
only be used for cooling if Q is hotter than the equilib- 
rium state and it can be used for heating if Q is colder 
than the equilibrium state. 

Proof: Let Q := (diag(s,r),diag(t,v)), where r is 
the occupation probability for the upper state. Follow- 
ing Theorem |l| we know that O x O e can be used for 
cooling if and only if there is a pair of states \i) and \j) 
in the composed system such that 



Pi/Pj < r/s, 

where we have taken the abbreviations given by equation 
(||) and Ei — Ej = E if Ei , Ej are the corresponding en- 
ergies and E > is the energy gap of the qubit. Assume 
the qubit to be colder than the environment. Then 

-D(Q) = ln(r/s) + pE. 

By definition of D we have 

-D{0 x O e ) +PE < kiipi/pj) < D{0 x O e ) + PE. 

Assume DiO) < D(Q). Using D(0 x O e ) = D{0) and 
equation (|2|) one concludes 

ln(r/s) < \n(pi/pj), 

hence cooling is impossible. Hence we see, that D(0) > 
D(Q) is necessary for cooling an already cold qubit. Sim- 
ilarly one shows, that this condition is necessary for heat- 
ing a hot one. 

Assume D(0) > D(Q). Choose a pair of states \i) and 
\j) of the object O such that 

D(0) = \\n{p i /p j )+P{E i -E j )\. 

Due to the antisymmetry of the right hand term with 
respect to i and j we can even assume 

D(0) =ln( ft /^)+/3(^-^) 
without loss of generality. By definition of D(Q) we have 

HPi/Pi) + P(E t - E 3 ) > | ln(r/s) + 0E\. 
We conclude 

ln(pi/ Pj ) + P{E t - E, ) > - ln(r/s) - PE 

and 

Infa/pj) + P(Ej - Ej + E) ^ - \n(r/s) 
E > E 

Now we take an ancilla qubit with the energies E := 
Ei - Ej + E and for the states |1) and |0). Note that 
here |1) need not be the upper state since we do not as- 
sume E > 0. In the composition of O with the ancilla 
qubit the pair of states 

and \j) <g> |0) 

have the energy gap E and for the relative inverse tem- 
perature of this pair we conclude: 

_ Hpi/pj) + PjEj ~E 3 +E) ^ - ln(r/a) 

P|i>®|l>,|j>®|0> - ^ > ^ 

Hence this pair can be taken for cooling due to Theorem 
|l|. In a similar way one can conclude that the object O 
can serve for heating. □ 
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The statement of Theorem [3| can be reformulated as 
follows: The lower and upper limit temperatures of an 
object O are given by the temperatures of the two di- 
agonal states pi ; 2 := d*a<?(si,2, ^1,2) of the qubit Q := 
(diag(si2,ri2),diag(t,v)) with the property D(Q) = 
D{0). 

To avoid false conclusions at this point we emphasize 
that in general a single copy of an object O is not suffi- 
cient for cooling or heating the qubit down or up to the 
limit temperatures if the latter has the reference tem- 
perature initially. The limit temperatures can only be 
approached by running an infinite number of stages of 
the same cooling or heating procedure. This requires an 
infinite number of copies of the object O since the re- 
source has to be refreshed in each stage. 

This observation leads to another natural question: 
Given any object O, what is the lowest temperature of 
the qubit which can be prepared by using one single 
copy of the resource O if the initial state of the qubit 
has the reference temperature. One can formulate this 
problem more generally: Assume we have an object O 
and any other system being in its equilibrium state 7 
initially. Which states 7 of the latter system can be pre- 
pared by coupling it to the object O and arbitrary ancilla 
equilibrium objects? With other words: Which objects 
O := (p, 7) can be obtained with the help of the resources 
O? This leads straightforwardly to a relation which is like 
a quasi-ordering on the set of objects, which we shall call 
the conversion order. 

Definition 7 We say 'the object O := (p, 7) can be ob- 
tained by using the resource O := (p, 7) formally written 
as 

o>6, 

if there exists a sequence of equilibrium objects O e n := 
{in, In) and a sequence of allowed transformations u n on 

O x O e , n x 6 

such that 

lim tr 12 (u n (p ® % ® 7X) = p, 

n — >oo 

where tr\2 denotes the partial trace over the left most and 
the middle component in the tensor product. 

It is easy to give the following necessary condition for 
0>6: 

Theorem 4 Let Ti. and Ti be the Hilbert spaces corre- 
sponding to the objects O :— (p, 7) and O :— (p,7), re- 
spectively. If O > O then there is a completely positive 
trace preserving map G from the set of density matrices 
on Ti to the set of density matrices on Ti satisfying 

Gp = p and G7 = 7 (4) 

as well as the covariance condition 



[H,G(.)] = G([H, .]), (5) 

where H and H are Hamiltonians corresponding to the 
equilibrium states 7 and 7, respectively. 

Proof: For every equilibrium object O etU := (7^,7™ ) 
and every allowed transformation u n on O x O e ^ n x O we 
define 

G n (a) := tr 12 (u n (a ® % <8 7X) 

for every density matrix a on Ti.. Every G n is a com- 
pletely positive trace preserving map satisfying G n (7) = 
7 since conjugation by u n preserves the equilibrium state 
of the total system. Since the set of completely positive 
trace preserving maps for given spaces Ti and Ti is com- 
pact, the sequence G n has a convergent subsequence. Let 
G denote its limit point. Obviously we have G{p) = p and 
G( 7 ) = 7. The covariance condition [H, G(.)] = G([H, .]) 
follows easily from the fact that the allowed transforma- 
tion commutes with the free evolution of the total system 
and preserves the equilibrium states in every tensor com- 
ponent. □ 

In the following we will try to work out the conversion 
order as explicitly as possible. We start by doing this for 
quasi-classical objects. In this case the quasi-ordering 
can be given explicitly: 

Theorem 5 Let O := (p,g) and O :— {p,g) be quasi- 
classical objects. Then 

o>6 

if and only if there is a stochastic matrix A such that 

Ap = p and Ag = g. (6) 

Proof: Assume O > O. Let p, 7,^,7 be the density 
matrices with diagonal entries p, g,p, and 5, respectively. 
Assume p, g £ E' and p, g £ M ; . For any density matrix 
a acting on C' we define the vector q(a) £ M. 1 by the 
diagonal of a. For every i < I define the density matrix 

ei := diag(0, . . . , 0, 1, 0, . . . , 0) 

where the entry '1' is on position i. Define a stochastic 
I x Z-matrix A by 

Ar = q(G(^2nei)), 

i 

where is the z-th component of an arbitrary vector 
r £ M 1 . Obviously, both equations in (^) are fulfilled. 

Assume there is a stochastic matrix A such that Ap = p 
and Ag = g. 

For every n £ N choose the environment 







(g® n ®g® n ,g® n ®~g® n ). 



Assume pel 1 and p e Mr. Hence the pure states of 
the systems described by g and g can be named by the 
symbols 1, . . . , I and the symbols 1, . . . , I, respectively. 

Let S n be the set of pure states in the composed system 
described by the equilibrium state 

g <g> g® n <g> g® n (g> g. 

Every element of S n is characterized by a word of length 
n + 1 over the alphabet {1, . . . , 1} and a word of length 
n + 1 over the alphabet {1, . . . , I}. In the following, only 
four attributes of these word pairs are relevant: 

1. the first symbol of the first word, denoted by j. 

2. the numbers of occurrences of the symbols 1, . . . , I 
in the first word, denoted by r\ , . . . , r; , or simply 
by the vector r £ N l with J^i r i = n + 1. 

3. the numbers of occurrences of symbols 1, .. . ,1 in 
the second word, denoted by si, . . . , sr or the vec- 
tor s e N'~ with J2i St = n + 1. 

4. the last symbol of the second word, denoted by x. 

Hence we assign the 4-tuple (j, r, s, a;) to every pair of 
words. Now let n,r,s be fixed. Note that all the states 
with a common vector r and s have the same energy. 
We write {(j, r, s, .)} for the cylindric set of states hav- 
ing j, r, s as the first three attributes. Their numbers 
of elements are given by a product of two multinomial 
coefficients 



rj nl (n + 1)! 
"■' " 1L ,<>.'! II, ,K>! 



h, := 



(7) 



Accordingly, write {(.,r,s,x)} for the set of states with 
r, s, x as the last three attributes. Their numbers of ele- 
ments are given by 



(n + 1)! s x nl 
IW^IIW*)! 



(8) 



Note that these sets depend on the number n, i.e., the 
size of the environment, although we do not indicate this 
explicitly by indices. 

Let a x j with j < i, x < I be the entries of the matrix 
A. Now we define for each x the / numbers 



<:rj 



i<3 



where [-J denotes the integer part of a real number. 

For each j choose I disjoint sets M x j C {(j, r, s, .)} with 
m x j elements. This is possible since 



Note that we do not indicate explicitly that the num- 
bers b well as the sets M x j depend on (r, s). 
Choose an injective map 



such that 



tt TiS : U x jM xj -> {(.,r, s, .)} 



TT r:S (M X j) C {(.,r,s,x)}. 



This is possible since ^2jni X j < c x . Extend 7r rs to a 
bijection 

7r rjS : {(.,r,s,.)} -> {(.,r, s, .)}. 

Now perform such a transformation n rtS on every set 
{(.,r, s, .)} C S n - For every n, this defines a bijection 

Let P n be the probability measure on S n defined by 
the composed system's initial state 

P®g® n <8> g® n ® g. 

Let P n be the image of P n under the transformation 7r„, 
i.e., 

P n := Pnon- 1 . 

Let T n C {(r, s) e N' xN [ | ^r,. = n + l, £ s 4 = n + 1} 
be a such that 

lim X ^({(.,r s * s .)}) = 

(r,s)ET„ 

lim X ^({(.,r, S ,.)} = l 

(r,s)ET„ 



and 



lim max {||- - 5 || + ||£ - £||} = 0. 



This is possible due to the law of large numbers, since 
the words with 

||--fl||«0 and \\--g\\*>0 
n n 

are typical (c.f. pi). 

Now we have to show, that asymptotically the prob- 
abilities of the symbols 1, . . . , I in the right most com- 
ponent of the system are changed from gi , . . . , gj to 
Pi, . . . ,pr by the permutations 7r„, i.e., we must show 



-> Px- 



We do this by proving 

P n ({(.,r,s,x)}) p x 



(r,s)eT n P n ({(., r ,3,v)}) Pv 
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With respect to the initial probability measure P n every 
word pair with attributes (j,r,s,x) has the probability 



Pj_ 

9j 



i<l 



n 



a] 



(9) 



If for every n our attention is only restricted to those 
vector pairs (r, s) which are elements of T n , we have the 
following asymptotic statements as n goes to infinity: 

1. The quotients c x /bj tend to g x /gj and bj/bi tend 
to gj/gi due to equations (0) and (j|). 



2. Therefore m X j/bj 



This follows from 1 by 



induction over j because . a X jgj — g x . 

3. The set {(j, r, s, .)} is more and more exhausted by 
U x Mxj in the sense that the number of elements of 
its complement becomes negligible compared to the 
number of elements of {(j, r, s, .)}. This shows that 
the total probability of the complement becomes ir- 
relevant, since all its elements have the same prob- 
ability. 



We conclude: 



lim max 

n-^oo (r,s)GT„ 

lim max 

n^oo (r,s)£T n 



Pn({(- 



0» 



P n ({(.,r,s,v)}) 

l x jWj 



J2j m 



Hj " • _ Z)j a xjPj _ 

12, "■ j'h"-J Y. r "rjl'j Pv ' 



For reasons of convenience we dropped the index n for 
r,s,m x j,bj. 

The reason for the first equality is given by statement 
3. The second one is proven by the statements 1 and 2. 
The third equality is due to equation @ and the last one 
by assumption. 

The statements 1-3 reflect the following idea behind 
our construction: The part a X j of the elements in 
{(j,r, s, ■)} is mapped onto an element in {(., r, s, x)}. 
Since the ratios of the sizes of these sets behave asymp- 
totically as gj : g x , the condition J2j a xjgj = 9x 
guarantees that such a map can be constructed as 
a bijective one. For typical (r, s), the numbers of 
elements in {(1, r, s, .)},..., {(I, r, s, .)} are related to 
each other by g\ , . . . , gi and the probabilities of sin- 
gle elements in {(1, r, s, .)},...,{(/, r, s, .)} are related 
by pi/gx, ■ ■ ■ iPi/gi- The total probability of the set 
{(., ., -,x)} after having performed the transformation is 
therefore given by £\ <i, ,g,i>, //, = Y,j a x 3 Pj = Px- □ 

Loosely speaking, we have shown, that any stochas- 
tic matrix, which maps an equilibrium state of the first 
system on the equilibrium state of the second one, can 
be carried out by an energy conserving process provided 



that any ancilla system being in its equilibrium state can 
be used. 

Note that Theorem ^| shows a symmetry with respect 
to an exchange of the actual probability distribution p 
and the equilibrium distribution g: 

Corollary 2 We have: 

(p, g) > (p, g) 

if and only if 

(g,p) > (g,P)- 

The physical consequences of this symmetry are by no 
means obvious. Its investigation has to be left to the 
future. 

Due to the convexity of the set of stochastic matrices 
we conclude: 

Corollary 3 Let O be an arbitrary object. Let O := 
(p,g) and O :— (p,g) two identical systems being in dif- 
ferent states. Then O > O and O > O implies 

0>(Xp+(l-X)p,g)) 

for every < A < 1. 

Obviously, it is not satisfactory to restrict the analysis 
to the quasi-classical case. Fortunately, there are many 
cases where the investigation of the conversion order can 
be reduced to the conversion order on the quasi-classical 
objects and then Theorem ^ is used for proving consid- 
erably more general theorems. For that purpose we need 
a definition and a technical lemma: 

Definition 8 Let O := (p, 7) be an arbitrary object and 
B be a basis diagonalizing 7. Let p and g be the vectors 
given by the diagonal entries of p and 7, respectively. 
Then we define the corresponding quasi- classical object 

C B (0) := (p,g) 

with respect to the basis B. 

Lemma 4 Let O :— (p, 7) be an arbitrary object. For 
any basis B diagonalizing 7 we have 

O > C B (0). 

Proof: Let B be given by B := {|1), . . . , \l)}. Let a 
be the maximally mixed state in I dimensions. Take the 
equilibrium object O e := (cr,a). With the help of O e we 
can obtain Cb(0) by using the resources O: Take the 
initial state p (S> a % 7 of the tripartite system 7 eg) a Cg) 7 
and perform the transposition 

|i) (g> |i) (8) \k) <-> \k) <8> <8> \i), 
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where © denotes the addition modulo I. This transforma- 
tion is energy conserving since the equilibrium object is 
degenerated and the other systems have identical level 
structure. Obviously the transformation transfers the 
diagonal entries of p to the other identical system and 
destroys the coherence since the coupling to the degen- 
erated ancilla system acts like a measurement. □ 

We are now able to draw some important conclusions: 

Theorem 6 (partial converse of Theorem ^) If at least 
one of the two objects O := (p, 7) and O := (p, 7) is 
quasi-classical, i.e., 

[p, 7] = or \p, 7] = 

the following equivalence holds: 

o>6 

if and only if there is a completely positive trace preserv- 
ing map fulfilling equations (Q) and (j^). 

Proof: Let [p, 7] = 0. Then we have [p, H] = for the 
corresponding Hamiltonian. Take G fulfilling the equa- 
tions (§) and (|) of Theorem M. Then we have: 

= G([H,p]) = [H,G(p)} = [H,p]. 

Hence [p, 7] = 0. Hence it is sufficient to show the state- 
ment for the case [p, 7] = 0: 

Let Qi and Qi be the spectral projections of 7 and 
7, respectively. Then P(a) := J2i QiPQi an d P{&) '■— 
Qi<rQi project any arbitrary density matrix a and a on 
its time average with respect to the evolution generated 
by H and H, respectively. Due to the covariance condi- 
tion of G we conclude 

G(P(p)) = P(G(p)) = P(p) = p. 

Without loss of generality we can assume that 
P(p), 7, p, 7 are diagonal since [P(p), 7] = 0. For any den- 
sity matrix a acting on C m with arbitrary m let R(o~) be 
the density matrix obtained by cancelling the off-diagonal 
entries. 
We define 

G' := R0G0R. 

Due to R(P(p)) = P(p) and R{p) = p we see that G' 
satisfies the equations (||) and (||) as well. Since G' de- 
fines a map from diagonal matrices on diagonal ones it 
can be described by a stochastic matrix. Therefore we 
can apply Theorem || to show that 

(R(p),-Y) > (P,7) 

by taking the canonical basis of C' as B. Lemma ^ com- 
pletes the proof due to the transitivity of the conversion 
order. □ 

For [p, 7] = the conversion order can be reduced to 
the quasi-classical case in the following sense: 



Corollary 4 Let P(a) be (as in the proof of Theorem ^) 
the time average of any density matrix a. If [p, 7] = 
then the following statements are equivalent: 

1- := (p )7 )>(p,7) =-0 

2. There is a basis B diagonalizing 7 and a basis B 
diagonalizing p and 7 simultaneously such that 

C B (0) > C S (0). 

3. For every basis B diagonalizing P(p) and 7 simul- 
taneously and every basis B diagonalizing p and 7 
simultaneously 

c B {o) > c s {6) 

holds. 

Proof: 1 4=> 3: Like in the proof of Theorem |^ there 
is a stochastic matrix mapping the diagonal entries of 
Rb(p) on the diagonal entries of Rg(p) and the same for 
the corresponding equilibrium states. 3 2: Obvious. 
2 => 1: The stochastic C matrix mapping the diagonal 
entries of Rb{p) onto the diagonal entries of p can be 
extended to a completely positive trace preserving map 
by: 

G :=CoR B . 

Clearly G fulfills the requirements of Theorem M. □ 

For generic pairs of objects (p, 7) and (p, 7) no differ- 
ence of the eigenvalues of H will coincide with the eigen- 
values of H . One can show that in this case the condition 
[p, 7] = is necessary: 

Lemma 5 Let the energy levels of the objects O :— (p, 7) 
and O := (p, 7) be such that no energy difference in O 
coincides with any difference in O. Then O > O implies 

[p,7] =0. 

Proof: Let G be the completely positive map re- 
quired by Theorem ^| Canonically, we extend G to a 
linear map to the set of matrices acting on the corre- 
sponding Hubert space. Let \i) and \j) be eigenvectors 
of H with eigenvalues Ei and Ej. Then \i)(j\ is an eigen- 
vector of the operator [H, .] with eigenvalues Ei — Ej . 
Due to G([H,.\) = [H,G(.)] the density matrix G(\i)(j\) 
has to be an eigenvector of the superoperator [H, .] with 
eigenvalues Ei — Ej as well. But there are no eigenvectors 
with this eigenvalue by assumption. Hence G(\i)(j\) = 0. 
Hence every density matrix in the image of G commutes 
with H and 7. □ 
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Now we will show, that the problem of cooling a qubit 
is indeed a typical application of the conversion order. In 
our formal setting we can formulate it as follows: Cooling 
the qubit down to the temperature T means preparing 
the object 



0:=( 
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with 



Pi 



as well as 



9l 



I + e -E/(kT) 



I + e -E/(kT) 



and p2 = 1 — pi 



and <?2 — 1 — 9i • 



For given resources O it seems hard to decide whether 
there is a completely positive map as specified by Theo- 
rem H Fortunately the problem turns out to be equiv- 
alent to a well-known problem of testing hypotheses: If 
one wants to decide whether a given state is the state p 
or the state 7 one has to construct a measurement such 
that the measurement outcome tells whether p or 7 is 
more likely. Such a decision rule can be described by a 
positive operator valued measure (£ p ,£ 7 ) where £ p and 
£ 7 are positive operators on the resource's Hilbert space 
with £ p + £ 7 = 1 . Then the risk of the error of the first 
kind, i.e., the risk of deciding p if 7 is actual, is given by 

Fi := tr(j£ p ) 

and the risk of the error of the second kind is given by 

F 2 := ir(p£ 7 ). 

If we want to distinguish between the qubit state with 
temperature T and the state with temperature T, a 
straightforward decision rule would be given by measur- 
ing whether the system is in its upper or in its lower 
state. In the first case we will decide to have the higher 
temperature T, else we decide for T. This would be a 
decision rule with the error probabilities 



F = 



,-E/{kT) 
I + e -E/(kf) 



and F 2 = 



1 



I _|_ e -E/(kT) ■ 
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If the cold qubit has been prepared by using the resources 
(p, 7) one can define a decision rule for the distinction be- 
tween p and 7 with the same error probabilities by 

(£ p oG,£ 7 oG), 

where (£p,£ 7 ) is the decision rule described above and 
G is a completely positive trace preserving map with the 
required properties. Hence the resources (p, 7) can only 
be used for cooling the qubit down to the temperature T 
if there is a decision rule 



(£ p , £ 7 ) 

with the error probabilities given by equations (|Io[). As 
one of the main results of our theory, it turns out that 
this condition is even sufficient: 

Theorem 7 The resource (p, 7) can be used for cooling 
the qubit down to the temperature T if and only if there 
is a decision rule (£ p ,£ 7 ) with [£7,7] = such that the 
errors are given by 



Fx 



-E/{kT) 



-E/(kT) 



1 



1 



-E/(kT) 



Proof: That the condition is necessary has already 
been explained above. 

The other direction can be seen as follows: Define a 
map from the density matrices on the Hilbert space of O 
by: 



G(a) 



tr 
tr{£ p cr) 



The map G is completely positive since every positive 
map with a commutative image is completely positive. 
Furthermore it fulfills the requirements of Theorem ^ 
(Note that we have [£ 7 , 7] = by assumption) . □ 

One may question the practical importance of the con- 
verse direction which states that a cooling procedure is 
possible if the conditions of Theorem |?] are satisfied, since 
we used rather sophisticated unitary transformation in 
the proof of Theorem^. However, it is not clear whether 
a more suitable environment (e.g. an infinite dimen- 
sional one) might allow optimal transformations which 
are much more natural. Furthermore it is an important 
insight that it is not possible to derive any tighter bounds 
for the resources within our setup. 

If any resource object O := (p, 7) is given and the cri- 
terion of Theorem fj] tells that O is not sufficient for ob- 
taining the demanded temperature, it is a natural ques- 
tion whether sufficient cooling is enabled by using many 
copies of the object O. Therefore one would ask for the 
least n such that the resource object O n :— (p® n ,7® n ) is 
sufficient for preparing a qubit with temperature T. Us- 
ing Theorem |?], this is the question of the increase of the 
distinguishability between the states p®" and 7®™ (sec 
]l7t). However, it is important to note that the condition 
[£ 7 , H] = in Theorem^ differentiates the problem from 
the usual information theoretic questions. — Note that 
there can be an abundance of basis diagonalizing p®" and 
7® n simultaneously. Therefore the application of Theo- 
rem [7] is by no means easy! We will restrict our attention 
to the quasi-classical case, where we can use essentially 
Stein's Lemma 18 of classical information theory: 
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Theorem 8 For a quasi-classical object O :— (p,g) de- 
fine its Kullback-Leibler Relative Information as 



S(g II p) ■= X^ iln 



We consider the situation where the n-fold copy of this 
resources O n := (p®™, g® n ) is used for cooling a two-level 
system with energy gap E. Let T n denote the lowest ob- 
tainable temperature. Then we have: 

lim nkT n — E S(g \\ p), 

n — >oo 

where k is Boltzmann's constant. 

Proof: Let g := (31,52) be the equilibrium state of 
the qubit. Let i := (ii, . . . , i n ) 6 {1, . . . , l} n be a pure 
state in the n-fold copy of the system. Then, instead of 
working with positive operator valued measurements, we 
can specify the decision rule by the conditional probabil- 
ities 

w(l|i) and iu(2|i) = 1 - w(l\i) 

describing the probability for deciding g or p (respec- 
tively) when i is measured. The corresponding error 
probabilities are given by 



and 



F 1 ='£w(2\i)g® n (i) 



F 2 = 5>(l|i)p®»(i), 



F 1 =5>(2|i) 5 ®»(i) = l-£ w(l\i)9® n (i) 

i ieB c 

= 1 - 9i = h 
as required by Theorem ^. Furthermore we have 

F 2 = £>(l|i)p«»(i) 

< w(l|i)5®"(i)e- n(s(9!,p)_£) = gie ~ n ^^\\p)-^. 

If the decision rule w(.\.) is defined in any other way, we 
have 



F 2 > ^(l|i)p 8n (i) > w(l|i) 5 ® n (i)e-™ (s(ffl|p)+e) 
ies £ ieB e 

= i 1 ~ w ( 2 |i))5 8n (i) e"™ (5(ff 11 p)+e) 

i£B c 

> (g® n (B e ) - Fi)e _n(5(9l|p)+e) . 
With Theorem g we obtain: 



a e -n(S(g\\p)+e) > 



-E/(kT n ) 



91 



1 + e -E/(kT n ) 

> (g® n (B e ) _Fi)e- n ( s ( fl H p )- 6 ). 



Since g® n (B e ) converges to 1 and F\ is constant we get: 
lim nkT n = S(g \\ p) E. 



□ 



where we consider the vectors p® n and <?®" as probability 
measures on {1, . . . , l} n in a straightforward way. Now 
the proof goes in strong analogy to the proof of Theorem 
4.4.4 in pl| with the difference that we have a stochas- 
tic decision rule, not a deterministic one. For any e > 
define the set B e C {1, . . . , l} n by 

B t := {i| S(g || p) - e < - £ ln(^ . / Pi . ) < % || p) + e} 



Due to the law of large numbers we have: 
lim g® n (B e ) = 1 > 51 

n — >oo 

Therefore, for large n, we can define a decision rule by 

91 



w(l\i) 



Vi g B f 



and 



g® n {Be) 

w(l\i) :=0 Vi €{l,...,Z} n \B e . 



V. FURTHER APPLICATIONS OF THE 
CONVERSION ORDER 

One of the big merits of the Second Law of Thermo- 
dynamics is the restriction it puts on the efficiency of 
conversion of heat to other forms of energy: A power 
station working with two heat reservoirs having temper- 
atures T and T with T > T can never work with an 
efficiency above 

f-T 



T 

We will show in which sense our theory puts restrictions 
on the efficiency of energy conversion processes which are 
apparently not given by easy conclusions from the well- 
known laws of thermodynamics: Assume that we have an 
energy source, i.e., an object (p, 7) such that the mean 
energy of the state is above the mean energy of equilib- 
rium, i.e., we have 



We have 



tr(Hp) > tr{Hj). 
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Converting the energy to another form of energy means 
preparing another object (p, 7) by using (p, 7) as re- 
source. Generically, we will not expect that it is possible 
to undo the conversion, i.e., to prepare (p, 7) by using 
now (p, 7) as resource. In general, for a given system 7, 
we cannot expect that there is a state p such that 

(p,7) > (P,7) > iP,l)- 

We can say: The transport of the energy to the other 
system is an irreversible process so that we cannot re- 
gain the original resources. We will illustrate this by an 
easy example with two qubits: 

Take a qubit where the upper level has a higher occu- 
pation probability compared to the equilibrium: 

O ■= (0i,P2), {91,92)) with p 2 > g 2 , 

where P2 and 32 denote the occupation probabilities of 
the upper level. For another qubit described by the equi- 
librium probabilities gi and g2 for the upper and lower 
level let p2 be the largest probability such that 

(p,g) > (p,g)- 

Assume 

(p,g) > (p,g) > (p,g)- 

Then there are stochastic matrices A and B such that 

BAp — p and BAg = g. 

Since we assume p 7^ g the matrix BA must be the iden- 
tity matrix. Therefore either A and B are identity matri- 
ces or A and B are transpositions exchanging the upper 
and lower state. We can exclude the latter case since that 
would mean that g\ > cj2 if g\ < 32 or g\ < §2 if g\ > <?2 ■ 
This is not possible for any temperature. 

If B and A arc identity matrices the energy levels of 
both systems are compatible. In this case it is obviously 
possible to transfer the energy without loss. In all the 
other cases the greatest p' 2 such that 

(p,g) > fag) > ({i-p' 2 ,p' 2 ),g) 

has a value below P2, i.e., we obtain a lower probability 
for the upper level compared to the initial one. Of course 
we can not apply these arguments if many copies of these 
qubits are available. But even in this case we have the 
statement that energy conversion with lower loss requires 
processes involving more qubits at once. Hence energy 
conversion with high efficiency turns out to be a matter 
of complexity of the conversion process. 

VI. COMPARISON WITH LANDAUER'S 
PRINCIPLE 

To elucidate the connection of our analysis with Lan- 
dauer's principle we reformulate it within our framework. 



It should be emphasized that the formulation 'The era- 
sure of one bit of information requires at least the dissi- 
pation of the energy fcTln2.' has to be read in the sense 
that the bit is in a totally unknown state, i.e., the era- 
sure changes the probabilities of the state |0) from 1/2 
to 1. It is straightforward to model the bit as a two-level 
system being in its maximally mixed state initially. If 
we assume the two-level system to be degenerated then 
the erasure process fits well into our framework since 
the maximally mixed state is the equilibrium state in 
this case. Anyway, in the non-degenerate case it would 
be more complicated to see Landauer's principle since 
the two-level system may supply the energy required for 
its own erasure. The requirement of the energy supply 
fcTln2 should be made more precisely: Of course this 
energy cannot be supplied by the heat of an reservoir 
having the temperature T, since heat is a useless form of 
energy. We rather need free energy for driving the pro- 
cess. Therefore, we need resources (p, g) such that the 
free energy of p exceeds the free energy of g at by least 
fcTln2. Note that this difference of the free energies of 
p and g is given by the Kullback-Leibler Information up 
to Boltzmann's constant: 

The free energy of any state p with respect to the in- 
verse temperature /3 is given by: 

F g (p) := E g (p) - ~S(p), 

where S(p) := — J^pjlnpj is the entropy and E g {p) is 
the mean energy in the state p (in view of the energy 
level structure defined by g), i.e. 

E g(p) 

i 

Easy calculation shows the following well-known result: 
F g (p) - F g (g) = ±£> HPi/gd) = ~ p S(p II 9). 

Note that here (in contrast to Theorem |J) the relative 
information S{p\\g) instead of S(g\\p) occurs! There- 
fore, we rephrase Landauer's principle as: 'The erasure 
process (in the sense above) requires an object (p, g) with 
S(p || g) > In 2'. In order to show this, we will need the 
following Lemma: 

Lemma 6 For arbitrary objects (p, g) and (p, g) 

(p, 9) > (p, g) 

implies 

S(p\\g)>S(p\\~g) 

and 

S(g\\p)>S(g\\p). 
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Proof: It is well-known that 5(.||.) is a distance mea- 
sure on the set of probability measures which is decreas- 
ing with respect to stochastic maps (Uhlmann's mono- 
tonicity theorem Jl!|). □ 

Corollary 5 ('Landauer's principle') To obtain the per- 
fectly initialized bit p = (0, 1) from the maximally un- 
known bit g — (1/2, 1/2) one needs resources (p,g) with 
S(p\\g) >ln2, i.e, 

(p, g) > lp, g) 

implies 

S(p\\g) >ln2. 

This can be seen by checking the equality S(p || g) — 
In 2. 

Corollary 6 ('Perfect erasure is impossible with generic 
resources') Let p be a state with pi ^ for every i. Then 
there is no n such that 

(P® n ,g® n ) > (P,g), 

with p = (0, 1) and g = (1/2, 1/2)). This can be seen by 
S(g\\p) = o ^S(g® n \\p® n ). 

Note that Landauer's principle is arguing with the rel- 
ative entropy S(p \ \ g) whereas our analysis uses S(g\\p). 
This exchange of the role of p and g is more important 
than it seems: Both quantities measure the distance from 
the equilibrium state, but with respect to the first dis- 
tance measure the states p := (e, 1 — e) and p' :— (1, 0) 
have almost the same distance from equilibrium if e is 
small. In contrast, the distance measure obtained by ex- 
changing the role between p and g, converges to infinity 
as e tends to 0. 

Therefore, S(g || p) seems more appropriate for describ- 
ing the difficulties in approaching the absolute Zero! In 
other words: The usual thermodynamic quantities like 
energy, free energy and entropy cannot explain 'the hard- 
ness of the struggle against the last milli-Kelvin above the 
absolute Zero'. 



VII. WHAT IS THE KULLBACK-LEIBLER 
INFORMATION OF A TYPICAL ENERGY 
SOURCE? 

One may rephrase our results by the statement 'for reli- 
able bit erasure one needs much more than the free energy 
ln2fcT'. But this formulation is misleading: Even for ar- 
bitrarily reliable bit erasure, there cannot exist any lower 
bound tighter than the one given by Landauer: If the re- 
source's state is a pure one it may enable perfect erasure 
even with the free energy In 2kT . This statement is triv- 
ial since any qubit can be prepared into a perfect pure 



state if the resource is given by a qubit with the identical 
energy gap as the first one being in a pure state. This 
example seems to be unserious since it shifts the problem 
of cooling to the problem of supplying resources with the 
same temperature. Furthermore the problem of cooling 
seems to have a circular logical structure. Nevertheless 
the example shows, that any statements about tighter 
bounds have to refer to particular assumptions about the 
statistical properties of the energy source's state. 

In view of this, we rephrase our results more carefully: 
Given any resource object O :— (p, 7) with the property 
that no eigenvalue of p is 0. Then an arbitrarily reli- 
able bit erasure process requires arbitrary many copies 
of O, i.e., we need the resources O n with appropriately 
large n, even if the free energy of O m exceeds InfcT al- 
ready for a considerably smaller number m. For making 
definite statements about n and m one should make as- 
sumptions about p and 7 and fix the demanded error 
probability. Deriving statistical properties of the states 
of realistic energy sources is not easy and should be a 
subject of further research. However, from a quite fun- 
damental point of view, it is quite natural to ask for the 
'thermodynamic worth' of a heat source with respect to 
good cooling and reliable bit erasure: We assume that 
the resource's state p is a thermal equilibrium state with 
temperature T > T. This assumption is an example of a 
non-circular way of treating the problem of the required 
resources: The resource's state is prepared by controlling 
macroscopic quantities (in our example the temperature) 
without any direct possibility of controlling its micro- 
physical state. We show that in our example the relative 
information can be calculated explicitly if the partition 
function of the energy source is known: 

Lemma 7 Let (p, g) be an object where p is an equilib- 
rium state for the inverse temperature (3 and g is the 
equilibrium state for the environment's inverse tempera- 
ture p. With the partition function 

Z((3) 

i 

we have: 

Pi = e-P E */Z(P), 9i = e-P/Z(p). 
Hence we get the Kullback-Leibler Information 

s(g \\p) = ^gi^{gi/Pi) 

= )xxZ0)-\nZ{f3)+E g {g)0-(3), 

where E g (g) is the mean energy of the equilibrium state 
g. This term is clearly finite for p and (3 being finite. 
Hence the required number of copies of the heat source 
(p, g) for cooling down to the demanded temperature can 
be estimated by knowing the temperatures and the parti- 
tion function. 
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VIII. CONCLUSIONS 



tr{AuBu*) > tr(AB). 



To investigate the problem of cooling from a quite fun- 
damental point of view our model includes the driving 
energy source as a quantum system with density matrix 
describing its statistical state. This setup elucidated the 
lacks of the traditional thermodynamic laws for explain- 
ing the resource requirements for cooling processes ap- 
proaching the ground states: It is by no means sufficient 
that the energy source is able to supply enough free en- 
ergy, it rather is necessary, that the density matrix of 
the energy source has a large enough distance from its 
equilibrium state in another information theoretic sense. 
One has to distinguish between two different questions: 
Firstly one wants to determine whether a qubit can be 
cooled down even further if it is already colder than the 
environment's temperature. This problem turned out 
to be essentially a geometric one and we have shown, 
that the limit temperature at which every cooling pro- 
cess breaks down is given by a simple parameter which we 
called 'maximal diagonal deviation' from equilibrium. If 
one starts with a qubit having environment temperature, 
this limit temperature can in general only be approached 
by repeating a cooling procedure with refreshed resources 
at each cycle. 

The second problem is to determine the temperature 
which can be obtained by starting with a qubit with the 
environment's temperature if no such refreshment of the 
resources is allowed. Here the determination of the possi- 
bilities of cooling is essentially equivalent to the determi- 
nation of an optimal decision rule which can distinguish 
between the resources density matrix and the correspond- 
ing equilibrium density matrix. This result strongly em- 
phasizes the fact that information theoretical arguments 
can rule out physical processes in a way which goes far 
beyond usual entropy arguments. 

In a straightforward way, our theory applies to the 
more general question of the resources needed for prepar- 
ing approximately pure states in any multi-level quantum 
system. This justifies the quite general formulation of the 
title: The thermodynamic costs of reliability. 

IX. APPENDIX 

For the proof of Lemma [l] we need the following tech- 
nical lemma: 

Lemma 8 Let A :— diag(a\, . . . , a n ) with a\ > ai > 
. . . > a n and B := diag(bi, . . . , b n ) with 6j = — 1 for i < I 
and bi = 1 for i > I. 

Let u be an arbitrary unitary operator. Then 



Proof: We have: 

tr(AuBu*) -tr(AB) 
E"'E''< : "■" ~ 

3 i 

where we have used Y) ^ \uji\ 2 = 1 since u is unitary. The 
term in equation ( |TT| ) reads as: 

3 i^J 
In n I 

E m 2 2+ E ^Em 2 (- 2 )^ ( 13 ) 

j=l i=i+l j=l+l i=l 

In n I 

2a*E E M 2 -2a/+i E E M"' ( 14 ) 

3=1 i=l + l j=l+l i—1 

This term is greater or equal than zero since the dou- 
ble sums are the same: Because u is a unitary operator, 
the row square sums as well as the column sums equal 1. 
Therefore, 

In n I 

E E M a = i-£iV= E Em 2 - 

j + l i=l + l i,j<l j=l + l i=l 

□ 

Now we are able to prove Lemma 
Proof: u commutes with 7 ® a and hence with its 
spectral projections. Therefore we have 

u = Uj with uj := PjuPj. 

Then it is sufficient to show 

tr(ujO>Uj(l <S> a z )) < tr(PjaPj(l ® a z )). 

Since 

tr(ujCt*Uj(l ® cr z )) = tr(ujau*Pj(l ® a z )Pj), 

we can reduce the problem completely to the situation of 
Lemma ^| by considering the range of every Pj separately: 
Restricted to the range of Pj, the operator Uj acts as a 
unitary one. □ 
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